Research that powers transparent, reliable, and effective AI models.

Al is being rapidly being adopted by people in nearly every industry. Frontier labs are fine-tuning their models for higher quality outputs, but the industry still does not have a deep understanding of why Al models say what they say.

This lab is focused on scaling the interpretability research necessary to make better AI systems possible.